Semi-supervised model-based document clustering: A comparative study
نویسندگان
چکیده
منابع مشابه
Document Clustering Based On Semi-Supervised Term Clustering
The study is conducted to propose a multi-step feature (term) selection process and in semi-supervised fashion, provide initial centers for term clusters. Then utilize the fuzzy c-means (FCM) clustering algorithm for clustering terms. Finally assign each of documents to closest associated term clusters. While most text clustering algorithms directly use documents for clustering, we propose to f...
متن کاملTopic Oriented Semi-supervised Document Clustering
In our study on developing a text mining prototype system, it is needed to group documents according to author’s need. However, Traditional documents clustering are usually considered an unsupervised learning. It cannot effectively group documents under user’s need. To solve this problem, we propose a new documents clustering approach. The main contributions include: (1) Describes user’s need b...
متن کاملUser-Interest-Based Document Filtering via Semi-supervised Clustering
This paper studies the task of user-interest-based document filtering, where users target to find some documents of a specific topic among a large document collection. This is usually done by a text categorization process, which divides all the documents into two categorizes: one containing all the desired documents (called positive documents) and the other containing all the other documents (c...
متن کاملMedline Document Clustering with Semi-Supervised Spectral Clustering Algorithm
To clustering biomedical documents, three different types of information’s are used. They are local content (LC),global content(GC) and mesh semantic(MS).In previous method only one are two types of information are cluster using Constraints and distance based algorithm. But in proposed system we used Semi Supervised clustering algorithm. It made most of the noisy constraints to improve clusteri...
متن کاملComparative Study on Context-Based Document Clustering
Clustering is an automatic learning technique aimed at grouping a set of objects into subsets or clusters. Objects in the same cluster should be as similar as possible, whereas objects in one cluster should be as dissimilar as possible from objects in the other clusters. Document clustering has become an increasingly important task in analysing huge documents. The challenging aspect to analyse ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Machine Learning
سال: 2006
ISSN: 0885-6125,1573-0565
DOI: 10.1007/s10994-006-6540-7